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The Timeline 




• Access to position, event, and state data 

• Lots can happen in 90 seconds 
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The Problem 


• Standard approach 

• Worry about what to 
avoid or approach & 
how to get there 

• Future events will 
change the outcome 


• Plosh: past matters 

• Mistakes must be undone 

• A move in the past may 
appear benign but game- 
winning 

• Past can be rewritten out 
from under you 
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Time Management 


if(ai_Obj == JOB J_UM50_PREVI0US_M0VE) 

{ 

PERFORM UNDO_LAST_UNIT_COMMAND; 

int time_of_command = perf_ret; 

num_undos_i ssued = num_undos_stored + l; //manually increase the number of undos for Ai 
PERFORM set_player_time (t i me_of_command) ; 

update_timel i ne = l; 

target = saf_ai_instant_updater: 

PERFORM SET^AC HRO NAL_F I E ld l; //set the flag to 1 
PERFORM S ET_P L AYER_TIM E_R ATE $RATE_PAUSED ;| 

>_ 
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Time Management: 
Instant Updating 


PERFORM GET_AC HRONAL_F I E LD SAF_AI_INSTANT_UPDATER; 
int timeline_updater = perf_ret; 

//only run this code if the post death timeline updater is not running 

if (!timeline_updater && real_time > (1 ast_tur n_real_time + |l * $tps)) //it's been at least l second since 

{ 

PERFORM GET_PLAYER_TIME_RATE player; 
if(perf_ret == $rate_paused^ 

{ 

PERFORM SET_PLAYER_TIME_RATE $RATE_NORMAL ; 

} 

else if(current = player_time && current % 3 = O) //only run this via a 7 's own timewave 

{ 
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Metaplanning 



Planning where each game state is a plan, 
and each action is a change to the plan 


Can minmax on plans! 
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Metaplanning Strength 


• Tight loop: leverage cached timeline 

• Full metaplanning: plans of plans 

• Replanning (e.g., P. Breimyer, P. Wurman. "PBA 
Using Proactive Search to Make A* Robust to 
Unplanned Deviations." AAAI. 2008.) 
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Planning 
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Metaplanning / Metaminimax 
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Metaplanning in Plosh: 1 st Pass 


• After every order, traverse timeline 

- evaluate time of interest (TOI) list 

- prioritize and focus on 

- earliest death 

- last focus/turn time 

- if more than 1 plosh, last idle time per plosh 
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Metaplanning in Plosh: 1 st Pass 


Evaluate and prioritize every instant on timeline, 
scored from 0-15 based on situation: 

• first possible danger 

• any danger point 

• the most dangerous point 

• highest number of upgrades 

• sudden changes in map paint coverage 
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1 st Pass Heuristics 


• If can't drop a bomb (recharging, limited ability to change 
things), then deprioritize 

• At least one second different from other TOIs 

• If cannot reach any enemy (they can't reach you), then 
deprioritize 

• Check for enemy (or own) bombs 

• Also check similarity to other TOIs - if too similar, take best 

• Earlier in timeline prioritized 

• Evaluate severity of bomb inverse of distance away 
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V 

2 nd Pass: Deep Dive 



• Focus on top (3-5) TOIs per Plosh 

• Jump to each TOI, execute over time areas 

• Evaluate top 2 best actions 
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2 nd Pass: Deep Dive 


- If death 

- Jump to time of death, undo previous move 

- Jump back to previous move, reevaluate 

- Repeat *entire* process 

- If plosh is in danger, evaluate at the current instant 

- if plosh not in danger, run to end of action 

- Don’t second guess metaplan 
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Undoing Death 
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Deep Dive: No Immediate Danger 


• Prefer less utilized plosh 

• Mark everything visible as safe/unsafe from bomb 
drop in four cardinal directions 

• Don’t bomb unless everything plosh cares about is 
safe 

• Account for bomb cascades 
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Bomb Safety 

int store = target->Pos1tion; 

//store this upgrade or sploosh if it's reachable by the plosh, and set its 'own bomb safe’ flag if it's safe from having 
//a bomb placed at the spot on which the plosh is non' 

int dir = QUERY 6ESTM0VE [plosh, SACTlON_MOVE] MIN [(query <_> target) * 1.2] WHERE [1]; 

if (di r && !dir [S bestmove_gave_up] && dir != $quer.y_exhausted) '."if plosh is able to reach this taroet 

{ 

tx = store[$xpos] ; 
ty = store [SYpos] ; 

int own_bomb_saf e = 0; //flag set if this position is safe from own bombs 

if(tx != px && ty != py) 

{ 

own_bomb_saf e = l; 

} 

else if (abs(tx - px) > own_bomb_range | | abs(ty - py) > own_bomb_range) 

{ 

own_bomb_saf e = l; 

} 

store [ SSTOREDOWNROMBSAF E] = own_bomb_saf e; 

if (t_rank == $UPGRADE_RANK) 

{ 

target = ($AF_Ai_STORED_UPOtADES + upgrade_count) ; perform set_achronal_field store; 
upgrade_count = upgr ade_count + 1; 
if (own_bomb_saf e) 

{ 

saf e_upgrade_count = saf e_upgrade_count + l; 

> 

1 


GDC 


<> 


GAME DEVELOPERS CONFERENCE' March 14-18, 2016 ■ Expo: March 16-18, 2016 #G0C16 


' mv ^ 




ARTIFICIAL 

INTELLIGENCE 

SUMMIT 



Deep Dive: Bomb Placing 

- If place bomb, look at escapes 

If upgrade easily reachable 

Safe reachable location 

Otherwise don’t bomb, just move 
(especially beginning of game on some maps) 

Prefer hitting adversary with bomb blast 
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Deep Dive: Movement 


• Find location with the least amount of 
paint in bomb coverage area 

• Randomize to remove contention 

• Evaluate likely bomb blasts on the way 
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Deep Dive: Movement 
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3 rd Pass: Selection & Execution 


- Pick highest score TOI (prefer bombing efficacy, upgrades, 
overall safety) 

- Validate impact of action on future plan 

• If adversary capabilities at position (bombs / territory) that will reduce 
plan strength, reevaluate plan with second best action 

• If adversary has strong position after both actions tried, try next best 
TOI 

- Execute command 

- If remaining moves, re-run entire process 
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Conclusions 


Memory is cheap if state storage efficient 

- Cache plans even if not making time manipulation 
game! 

Likely to work well if can predict some player choices 

Implicit iterative model of when Al should perform 
rare, costly actions 
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Questions? 
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